<<<<<<< HEAD Ablation study of forester: Paper plots

This is the notebook where the visualizations from ablation_study_results_analysis were enhanced and modified for the needs of paper.

1 Imports and settings

library(ggplot2)
library(patchwork)
library(scales)

2 Data import

duration_train_df               <- readRDS('ablation_processed_results/training_duration.RData')
duration_preprocessing          <- readRDS('ablation_processed_results/preprocessing_duration.RData')
extended_training_summary_table <- readRDS('ablation_processed_results/extended_training_summary_table.RData')

3 Time analysis

duration_df                                 <- duration_train_df
full_duration                               <- duration_preprocessing$Duration + duration_df$Duration
duration_df$Preprocessing_duration          <- duration_preprocessing$Duration
duration_df$Preprocessing_duration_fraction <- round(duration_df$Preprocessing_duration / full_duration, 3)
duration_df$Full_duration                   <- full_duration
rmarkdown::paged_table(duration_df)

3.1 General time complexity

3.2 Preprocessing time complexity

3.2.1 Appendix Imputation

3.3 Feature selection time complexity

## Warning: Removed 1 rows containing non-finite values (`stat_boxplot()`).

4 Performance

4.1 Advanced preprocessing vs Baseline

4.2 FS impact on performance

4.3 Removal impact on performance

4.3.1 Binary Classification

4.3.2 Binary classification

======= Ablation study of forester: Paper plots

This is the notebook where the visualizations from ablation_study_results_analysis were enhanced and modified for the needs of paper.

1 Imports and settings

library(ggplot2)
library(patchwork)
library(scales)

2 Data import

duration_train_df               <- readRDS('ablation_processed_results/training_duration.RData')
duration_preprocessing          <- readRDS('ablation_processed_results/preprocessing_duration.RData')
extended_training_summary_table <- readRDS('ablation_processed_results/extended_training_summary_table.RData')

3 Time analysis

duration_df                                 <- duration_train_df
full_duration                               <- duration_preprocessing$Duration + duration_df$Duration
duration_df$Preprocessing_duration          <- duration_preprocessing$Duration
duration_df$Preprocessing_duration_fraction <- round(duration_df$Preprocessing_duration / full_duration, 3)
duration_df$Full_duration                   <- full_duration
rmarkdown::paged_table(duration_df)

3.1 General time complexity

3.2 Preprocessing time complexity

3.2.1 Appendix Imputation

3.3 Feature selection time complexity

## Warning: Removed 1 rows containing non-finite values (`stat_boxplot()`).

4 Performance

4.1 Advanced preprocessing vs Baseline

4.2 FS impact on performance

4.3 Removal impact on performance

4.3.1 Binary Classification

4.3.2 Binary classification

>>>>>>> b6c9e7735ce229d9a94dce9db6fcedec62936c73